An Analysis of Feature Selection and Reward Function for Model-Based Reinforcement Learning
نویسندگان
چکیده
In this paper, we propose a series of correlation-based feature selection methods for dealing with high dimensionality in feature-rich environments for modelbased Reinforcement Learning (RL). Real world RL tasks usually involve highdimensional feature spaces where standard RL methods often perform badly. Our proposed approach adopts correlation among state features as a selection criterion. The effectiveness of the proposed methods are compared against previous methods referred as 10PreviousFS [2] using the data from an intelligent logic tutor called Deep Thought (DT) [1]. We evaluated the effectiveness of different feature selection methods by expected cumulative reward (ECR) [3], considering two types of reward: immediate and delayed. Our results show that our proposed methods significantly outperform previous feature selection methods with both types of rewards. Moreover, the “best” policy induced using immediate reward differs significantly from that induced from delayed reward.
منابع مشابه
RRLUFF: Ranking function based on Reinforcement Learning using User Feedback and Web Document Features
Principal aim of a search engine is to provide the sorted results according to user’s requirements. To achieve this aim, it employs ranking methods to rank the web documents based on their significance and relevance to user query. The novelty of this paper is to provide user feedback-based ranking algorithm using reinforcement learning. The proposed algorithm is called RRLUFF, in which the rank...
متن کاملCompatible Reward Inverse Reinforcement Learning
PROBLEM • Inverse Reinforcement Learning (IRL) problem: recover a reward function explaining a set of expert’s demonstrations. • Advantages of IRL over Behavioral Cloning (BC): – Transferability of the reward. • Issues with some IRL methods: – How to build the features for the reward function? – How to select a reward function among all the optimal ones? – What if no access to the environment? ...
متن کاملReinforcement learning based feedback control of tumor growth by limiting maximum chemo-drug dose using fuzzy logic
In this paper, a model-free reinforcement learning-based controller is designed to extract a treatment protocol because the design of a model-based controller is complex due to the highly nonlinear dynamics of cancer. The Q-learning algorithm is used to develop an optimal controller for cancer chemotherapy drug dosing. In the Q-learning algorithm, each entry of the Q-table is updated using data...
متن کاملNovel Radial Basis Function Neural Networks based on Probabilistic Evolutionary and Gaussian Mixture Model for Satellites Optimum Selection
In this study, two novel learning algorithms have been applied on Radial Basis Function Neural Network (RBFNN) to approximate the functions with high non-linear order. The Probabilistic Evolutionary (PE) and Gaussian Mixture Model (GMM) techniques are proposed to significantly minimize the error functions. The main idea is concerning the various strategies to optimize the procedure of Gradient ...
متن کاملFeature Selection for Inverse Reinforcement Learning
We explore the problem of feature selection for inverse reinforcement learning in cases where the true reward is sparse in terms of features. We recover such structures by using a sparsity-inducing L1-norm in the problem formulation, and explore different techniques for efficiently solving the resulting non-smooth convex minimization problem. Our results indicate that the sparse problem formula...
متن کامل